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Abstract 

This paper deals with the problem of multicasting a set of discrete memoryless correlated sources 
(DMCS) over a cooperative relay network. Necessary conditions with cut-set interpretation are presented. 
A Joint source-Wyner-Ziv encoding/sliding window decoding scheme is proposed, in which decoding at 
each receiver is done with respect to an ordered partition of other nodes. For each ordered partition a 
set of feasibility constraints is derived. Then, utilizing the sub-modular property of the entropy function 
and a novel geometrical approach, the results of different ordered partitions are consolidated, which lead 
to sufficient conditions for our problem. The proposed scheme achieves operational separation between 
source coding and channel coding. It is shown that sufficient conditions are indeed necessary conditions 
in two special cooperative networks, namely, Aref network and finite-field deterministic network. Also, in 
Gaussian cooperative networks, it is shown that reliable transmission of all DMCS whose Slepian-Wolf 
region intersects the cut-set bound region within a constant number of bits, is feasible. In particular, all 
results of the paper are specialized to obtain an achievable rate region for cooperative relay networks which 
includes relay networks and two-way relay networks. 

Index Terms 

Aref network, compress-forward, cooperative relay network, Gaussian network, linear finite-field deter- 
ministic network, multi-layer coding, Slepian-Wolf, Wyner-Ziv. 

I. Introduction 

Consider a group of K sensors measuring a common phenomenon, like weather. In this paper, we 
investigate a communication scenario in which some sensors desire to obtain measurements of the other 
nodes with the help of some existing relay nodes in the network. In the language of information theory, 
we can consider measurements of sensors as outputs of discrete memoryless correlated sources and model 
the communication network as a cooperative relay network in which each node can simultaneously be a 
transmitter, a relay and a receiver. So the problem can be defined as below: 

Given a set of sources C/4 = {U aj : aj £ A} observed at nodes A = {a±, ■ ■ ■ , om} Q V respectively 
(V = {1, ■ ■ ■ , V} is the set of nodes in the network) and a set of receivers at nodes D = {d\, ■ ■ ■ , d^} Q V 
which is not necessarily disjoint from A, what conditions must be satisfied to enable us to reliably multicast 
Ua to all the nodes in T> over the cooperative relay network? 

The problem of Slepian-Wolf (SW) coding over multi-user channels has been considered for some special 
networks. First in (Tj, Tuncel investigated the problem of multicasting a source over a broadcast channel 
with side information at the receivers. He proposed a joint source-channel coding scheme which achieves 
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operational separation between source coding and channel coding in the sense that the source and channel 
variables are separated. He also proved the optimality of his scheme. In a recent work |2], this problem was 
generalized to the problem of lossy multicasting of a source over a broadcast channel with side information. 
In (3j, a necessary and sufficient condition for multicasting a set of correlated sources over acyclic Aref 
networks [4] was derived. The problem of multicasting correlated sources over networks was also studied 
in the network coding literature (5J, (6j. 

Cooperative relay network has been widely studied in terms of achievable rate region for relay networks 
|7j> ®> multiple access relay channels (9j and multi-source, multi-relay and multi-destination networks 
[T0|. In all the mentioned works, two main strategies of Cover and El Gamal for relay channels fTT) , 
namely, decode and forward (DF) and compress and forward (CF) were generalized for the cooperative 
relay networks. In a more general setting | [T2] , Giindiiz, et.al., consider a compound multiple access channel 
with a relay in which three transmitters where, one of them acts as a relay for the others, want to multicast 
their messages to the two receivers. Several Inner bounds to the capacity region of this network were derived 
using DF, CF and also structured lattice codes. Although finding the capacity of the simple relay channel 
is a longstanding open problem, an approximation for the Gaussian relay network with multicast demands 



has been recently found in fT3|-| 15 1. In these works, the authors propose a scheme that uses the Wyner-Ziv 
coding at the relays and a distinguishability argument at the receivers. 

In this paper, we first study the problem of multi-layer Slepian-Wolf coding of multi-component correlated 
sources, in which each source should encode its components according to a given hierarchy. Using the sub- 
modularity of the entropy function and a covering lemma, we prove an identity which states that for any points 
of SW-region with respect to joint encoding/decoding of the components, there exists a multi-layer SW-coding 
which achieves it. To the best of our knowledge, this identity is new and we call it the SW-identity. Then, 
we propose a joint Source-Wyner-Ziv encoding/sliding window decoding scheme for Slepian-Wolf coding 
over cooperative networks. In this scheme, each node compresses its channel observation using Wyner- 
Ziv coding and then jointly maps its source observation and compressed channel observation to a channel 
codeword. For decoding, each receiver uses sliding window decoding with respect to an ordered partition 
of other nodes. For each ordered partition, we obtain a set of DMCS which can reliably be multicast over 
the cooperative relay network. By utilizing the SW-identity, we obtain the union of the sets of all feasible 
DMCS with respect to all ordered partitions. Our scheme results in operational separation between the 
source and channel coding. In addition, this scheme does not depend on the graph of the network, so the 
result can easily be applied to any arbitrary network. We show that the sufficient conditions for our scheme, 
are indeed necessary conditions for the Slepian-Wolf coding over arbitrary Aref networks and linear finite- 
field cooperative relay networks. Moreover, we prove the feasibility of multicasting of all DMCS whose 
Slepian-Wolf region overlap the cut-set bound within a constant number of bits over a Gaussian cooperative 
relay network. This establishes a large set of DMCS that belongs to the set of DMCS which can reliably be 
multicast in the operational separation sense. Note that the model considered in this paper, encompasses the 
model of multiple access channel with correlated sources. So the set of feasible DMCS in the operational 
separation sense is a subset of all feasible DMCS. We extract an achievable rate region for cooperative 
relay networks by reducing sufficient conditions for reliable multicasting. We show that this achievable rate 
region subsumes some recent achievable rates based on the CF strategy (8j, fl6] |. In addition, we estimate 
the capacity region of Gaussian cooperative relay networks within a constant number of bits from the cut-set 
bound. Our result improves capacity approximation of Gaussian relay networks given in |l5j. 

The rest of the paper is organized as follows. In section [TTJ we introduce notations and definitions used 
in this paper. Section III derives necessary conditions for reliable multicasting of DMCS over cooperative 
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networks. Section IV studies the multi-layer Slepian-Wolf coding, in particular, a novel identity related to 
the entropy function is derived. In section [Vj we obtain feasibility constraints which are the main results of 
the paper. In sections VI and VII we derive necessary and sufficient conditions for multicasting of DMCS 
over some classes of semi-deterministic networks and Gaussian cooperative relay networks, respectively. 
Section VIII employs results of the previous sections to derive an inner bound and an outer bound for the 
capacity region of a cooperative relay networks. Section [FX] concludes the paper. 



II. Preliminaries and Definitions 

A. Notation 

We denote discrete random variables with capital letters, e.g., X, Y, and their realizations with lower case 
letters x, y. A random variable X takes values in a set X. We use \X | to denote the cardinality of a finite 
discrete set X, and px(%) to denote the probability mass function (p.m.f.) of X on X, for brevity we may 
omit the subscript X when it is obvious from the context. We denote vectors with boldface letters, e.g. x, y. 
The superscript identifies the number of samples to be included in a given vector, e.g., X % = (X%, • • • , Xj). 
We use T™(X) to denote the set of e-strongly typical sequences of length n, with respect to p.m.f. px(x) 
on X. Further, we use T e n (y|x) to denote the set of all n-sequences y such that (x, y) are jointly typical, 
w.r.t. pxy(x,u)- We denote the vectors in the j'th block by a subscript [j]. For a given set S, we use the 
shortcuts X$ = {Xi : i G S} and R$ = J2ieS ^ e use to denote the set theoretic difference of S 
and T ■ We say that a n < 2 nb , if for each e > and sufficiently large n, the relation a n < 2 n ( b-e ) holds. 



B. Sub-modular Function 

Let V be a finite set and 2 V be a power set of it, i.e., the collection of all subsets of V. A function 
/ : 2 V — > E is called sub-modular, if for each 5, T C V, 

f(S n T) + f(S u T) < f(S) + f(T) (1) 

Function / is called super- modular, if — / is sub-modular. Given two sets S,T and a sub-modular function 
/, we define f(S\T) ± f(S U T) - /(T). 

Let X_a be DMCS with distribution p{xjCj. For each S C A, we define the entropy function h as h(S) = 
H(Xs) where H(X) denotes the entropy of random variable X. It is well-known that the entropy function 
h is a sub-modular function over the set A (17 J. The sub-modularity property of the entropy function plays 
an essential role in the remainder of the paper, (in contrast to the non-decreasing property of the entropy, 
i.e, h(S) > h(T), VTC5). 



C. Some Geometry 

A polytope is a generalization of polygon to a higher dimension. Point, segment and polygon are polytopes 
of dimension 0, 1 and 2, respectively. A polytope of dimension d > 3 can be considered as a space bounded 
by a set of polytopes of dimension d — 1. The boundary polytope of dimension cZ — 1 is called facet. For a 
given polytope P, a collection of polytopes {Pi, • • • , P n } is called a closed covering of P, if P = U™ =1 P,;. 

Lemma 1: Let P be a polytope and F = {Pi, P2, • • • , Pn} be a collection of polytopes with the same 
dimension as P. If P and F satisfy the following conditions: 

1) Vi : Pi C P 
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2) Each facet of P is covered by some facets of some polytopes (Pa, • • • , Pj&). 

3) For each facet of Pj inside P, there is Pj ^ Pj such that Pj and Pj have only that facet as the 
common part. 

then J 7 is a closed covering of P. 
Proof: The proof is provided in the Appendix [A] ■ 
Lemma [T] provides a powerful tool for dealing with the regions which are described with a set of inequalities. 

Definition 1: A point Q = (gi, • • • , q^) in M. d is said to majorize point P = (pi, • • • if % > Pi for 
all z. In addition, point Q is said to majorize set V (denoted by Q >~ V), if there exists a point X G V 
which is majorized by Q. 

It is easy to show that majorization has the following simple property: 

QyV 1 UV 2 ^> QyV 1 orQyV 2 (2) 



Definition 2: Let / be a sub-modular function over the set V. The essential polytope associated with / 

is: 

P/ = {x£ Rl v l : x v = /(V) and VS C V,x s > f{S\S c )} (3) 

where x= [xi,x 2 ,--- ,£|v|] and x s = Y,ieS x i- 

The essential polytope of the sub-modular function / over the set V is a polytope of dimension |V| — 1, 
which has 2' v ' — 2 facets, each corresponding to intersection of hyperplane xj- = f(T\T c ) with P^ 
for each non-empty subset T C V. By Fff, we denote the facet corresponding to the subset T. Since 
g(T) = f{T\T c ) = /(V) — /(T) is a super-modular function, one can easily show that Ffj- is a non- 
empty polytope of dimension |V| — 2 (see for example, [18]) . 

Lemma 2: The facet Ffj- of polytope Pf can be decomposed to projections of Pj on MJ' and MP~ C 
(in which M s stands for the space {x G M' v ' : Vs G = 0}). More precisely, 

F />r = {x G Rl V l : x r G F$-, x r c G F$-} (4) 

where 

F$- = {xel r :x r = /(T|T C ), and VS C T, ar 5 > /(5|5 C )} (5) 

and 

= {x G M TC : x T c = f(T C ), and VS C T C , x 5 > f(S\T C \S)}. (6) 

Moreover, F^- and F^- are the essential polytopes of the functions f% : lT — > R and /2 : 2^ c — >• R 
respectively, where /i(<S) = f(S\T c ) and / 2 («S) = /(«S). 

Proof: The proof is provided in Appendix [B] ■ 
Lemma 3 ( [18]): Let f\ and f 2 be two sub-modular functions defined on a set V. Then, 

P/ 1+ / 2 =Pa+P/ 2 (7) 
where the sum of two sets is defined a.sX + y = {x + y:x<EX,y£ y}. 
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D. System Model 

A cooperative relay network is a discrete memory less network with V nodes V = {1, 2, • • • , V}, and a 
channel of the form 

(Xi,X 2 ,--- ,X v ,p(yi,y 2 ,--- ,yv\xi,x 2 ,--- , x v ), ^i, 3^2, • • • ,3V)- 

At each time i = 1, 2, • • • , every node v G V sends an input X^t £ A^, and receives an output Y v>t G 3^, 
which are related via p(ii,t, • • • , Yy tt \X\^i 

Definition 3 (Reliable multicasting of correlated sources over cooperative networks): Let A and V 

be two subsets of V corresponding to the set of the sources and the destinations, respectively. We say that 
the set of DMCS, Ua, can reliably be multicast over discrete memory less cooperative network, to all nodes 
in V, if there exists a sequence of a pair of positive integers (s n , r n ) such that s n — > oo, r n — > oo, — — > 1 
as n — y oo and a sequence of encoding functions 

fif : 1£" x 3^ ^X v for f = 1, • • • , r n 

at all nodes v G V, where, for the non-source nodes we let U v = $ and a set of decoding functions defined 
at each node di G T>; 

!,,r-':ir, x>-; ; ^ 

such that the probability of error 

vanishes for all di G V as n goes to the infinity. 

According to Definition [3j the joint probability distribution of the random variables factors as, 

s„ r„ V 

j=l t=l v=l 

Remark 1: The network model described in the Definition [3] includes several network models such as 
MAC with feedback, relay networks and multi-way channels (i.e., a generalization of the two-way channel). 

III. Cut-set type necessary conditions for reliable multicasting 

In this section, we prove necessary conditions for reliable multicasting of correlated sources over coop- 
erative network. 

Proposition 1: A set of DMCS Uj± can reliably be multicast over a cooperative network, only if there 
exists a joint p.m.f. p{x\>) such that 

H(U S \U A \ S ) < min min I(X W ; Y W c\X W c) (9) 

Proof: Using Fano's inequality, imposing the condition P^' Tn ^ — > as n — > oo, it follows that: 

VSCV,dieV\S:—H(UX\ YJ« , U% ) < e„ (10) 
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with e n — > as n — > oo. We also have ^H(Ug\U'j£~Y£U%) < e n . For each (W.dj) such that S C 
W C V and G W c , we have: 

Sn 
Sn 

<V^ y w c ^ +£ » (13) 

Sn 

1 r " 

= Y n E / ( C/ J"; rw^il^V^w^W- i) + £n (14) 

i=l 
1 r " 

= — ^ ^(^wc.il^^^^Vc^vvc^) - H(Y w a ii \U^Y^'oX W c ti ) + e n (15) 

Sn . -, 
i=l 

1 r " 

<-E ff(Yvv°,i|^W,i) - H(Y w c yi \U s A "Y v r 1 X VA ) + e n (16) 
Sn i=\ 
1 r„ 

= — ^(^VV,i; ^W c ,i|^W c ,i) + e n (17) 

Sn . . 
i=l 

— I(Xw,q; Y>v c ,qI^>V c ,Q) Q) + e n (18) 



Sr. 



< -/(Iw,q; ^Wc.qI^wc.q) + e n (19) 

Sn 

— t- I(Xyv; ^VV C |^W C ) (20) 

where ([14]) follows from the fact that I w c^ is a function of iXyJc , ^w c nA> ancI ^ e fact tnat n ^ — 



.4\<S, ( [To] ) follows since conditioning reduces entropy, (17 1 follows because (U*£, Yy ) — Xy^ — Yy^ form 



a Markov chain, ( fT8| ) is obtained by introducing a time-sharing random variable Q which is uniformly 
distributed over the set {1,2,- •• ,r n } and is independent of everything else, ( [20] ) follows by allowing 
Sn, r n — > oo with ^ — >• 1 and defining Yy = Yy,Q and Xy = -Xv,q. ■ 

IV. Multi-Layer Slepian-Wolf Coding 

Before describing our scheme and the related results, in this section, we deal with the problem of multi- 
layer Slepian-Wolf coding (ML-SW). Study of the ML-SW enables us to find a new tool to analyze the main 
problem. In the previous works (for example p9| , [20 1), ML-SW is used to describe a source with some small 



components (for example, by a binary representation of it) and then successively encoding these components 
with SW-coding instead of encoding the whole source at once. For example, if we describe an i.i.d. source 
S by (X, Y), i.e., 5 = (X, Y), instead of encoding S by R = H(S) bits/symbol, we can first describe X by 
Rx = H(X) bits/symbol and then apply SW-coding to describe Y by Ry = H(Y\X) bits/symbol, assuming 
that the receiver knows X from decoding the previous layer information as a side information. Since the 
total bits required to describe S in two layers is Rx + Ry = H(X, Y) = H(S), it follows that there is no 
loss in the two-layer SW-coding compared with the jointly encoding of the source components. A natural 
question is: How can this result be generalized to a more general setting of multi-terminal SW-coding? At 
first, let us look at the two-terminal SW-coding. Suppose two sources Si = (X\,Y\) and S2 = (X2, Y2) are 
given. Joint SW-coding yields that lossless description of (Si, S2) with rates (Ri,R%) is feasible, provided 
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Fig. 1. Two-Layer Slepian-Wolf coding for a pair of two-component correlated sources. This coding is suboptimal in the sense 
that it does not achieve the entire of Slepian-Wolf coding. 



that (Ri,R 2 ) € {(n,r 2 ) : n > H{X 1 Y 1 \X 2 Y 2 ),r 2 > H (X 2 Y 2 \X 1 Y 1 ) , r x + r 2 > H {XxX^Y^} . Now 
suppose the following simple ML-SW. Assume in the first layer, X\ and X 2 are encoded by SW-coding 
with rates (Ru,R 2 i) and in the next layer Y\ and Y 2 are encoded by SW-coding with rates (R\ 2 ,R 22 ) 
assuming that the receiver knows (X\,X 2 ) from decoding of the previous layer information (See Fig. [I]). 
The lossless description of (Si,S 2 ) in this manner is possible, if: 

R 1 = R u + R 12 > H(X X \X 2 ) + H{Y l \X 1 X 2 Y 2 ) > H{X 1 Y 1 \X 2 Y 2 ) 
R 2 = R 2i + R 22 > H{X 2 \X X ) + H{Y 2 \X 1 X 2 Y l ) > H{X 2 Y 2 \X l Y 1 ) 
Ri+R 2 > H{X X X 2 ) + HiY^X^) = H{X X X 2 Y X Y 2 ) 

This shows that this simple layering can not achieve all the points in the SW-region, in particular the 
corner points A = (H(X 1 Y 1 \X 2 Y 2 ),H{X 2 Y 2 )) and B = (H(XxYi),H(X 2 Y 2 \XiYx)) can not be achieved 
by this scheme(See Fig. [2]). But the point A can be achieved by successive SW-coding of X 2 , Y 2 , X\ and 
Y\ regarding that the previous sources are available at the receiver. This method suggests that instead of 
dividing the SW-coding in two layers, SW-coding can be performed in three layers: in the first layer X 2 is 
described for the receiver with rate R 2 \ > H(X 2 ), in the second layer (Y 2 ,X\) are encoded by SW-coding 
in the presence of X 2 at the receiver, and finally in the last layer Y\ is described using SW-coding assuming 
(X 2 ,Y 2 ,X\) are available to the receiver. Analyzing this strategy, yields that (Ri,R 2 ) are achievable if, 

Ri = R n + Rxi > H(Xi\X 2 Y 2 ) + HiYxlXxX^) = H^Y^X^) 
R 2 = R 21 + R 22 > H(X 2 ) + HiY^X^) > H^Y^X^) 

Ri + R 2 > H{X 2 ) + HiXxY^X^ + HiY^X^Xx) = H^X^Y^ 

From this strategy, the corner point A is achieved, but the corner point B is not achieved. In addition, as it 
can be seen in Fig. [2] the other corner point of this scheme (C) is coincident with one of the corner points 
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H{X 2 ) + H{Y 2 \X 1 X t ) 
H(Xi\Xt) + H(Kj| X 1 X,Yi) 




HiX^XzYi) H(X 1 \X 2 )+ ff(J?i)+ H{X^\) 
HiY^XiXaYi) HfY^Xi) 



Fig. 2. Slepian-Wolf rate region vs rate regions with two and three layers Slepian-Wolf coding. Segments AC correspond to the 
three-layer SW-coding, in which in the first layer, X2 is encoded, then in the second layer (Y2, Xi) is encoded assuming that X2 is 
already decoded at the receiver and in the third layer Y\ is encoded assuming that (X2, Y2, Xi) is already available at the receiver. 
Segment CD corresponds to two-layer SW-coding of the Fig.[T] Segment DB is obtained from a similar three layer SW-coding to 
that of segment AC. Notice that each corner point of any multi-layer SW-coding that lies inside the SW-region is coincident to a 
corner point of another multi-layer SW-coding. 



of the two-layer scheme. By symmetry, the corner point B is achieved by a three-layer scheme in which 
X\, (X2, Y\) and Y2 are encoded in the first, second and third layer respectively. In addition, as it can be 
seen in Fig. [2j the union of the regions of the three different layering schemes is a closed covering of the 
SW-region. Note that in all the three schemes, there is a hierarchy in the sense that the first component of 
each source (i.e., Xi) is encoded prior to the second component of it (i.e., Yj). The result of the two-terminal 
SW-coding suggests that to obtain the entire SW-region of multi-components DMCS, it suffices to consider 
all possible layering schemes such that a given hierarchy on each source is satisfied. 

Definition 4: An ordered partition C of a set V is a sequence [£±,£2, ■ ■ ■ ,£k] of subsets of V, with 
union V, which are non-empty, and pairwise disjoint. Denote the family of all ordered partitions of a given 
set V, by Jy. 

Consider a DMCS SV with two component sources, i.e., S v = (X v , Y v ). Now we describe ML-SW with 
respect to a given ordered partition C = [£1, • • • ,£r]- In addition, we assume that the decoder has access 
to side information Z which is correlated with (Xy,Yy) according to an arbitrary distribution p{x\>, yy> z). 

1) In the first layer, using SW-coding, X^ is encoded with rates R\ = (Ru, R12, • • • , R\v) in which 
for v ^ £%, we set R\ v = 0. The receiver can reliably decode X£ provided that 



V5 C Ci : Ris > H(X S \X CAS Z) 
Define the function he 1 '■ 2 V — > M as 



(21) 



h C:1 (S) = H(X SnCl \Z) 
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Now using the sub-modularity of the entropy function, we have 

h c ,i (S nT) + ftc 1 i(5uT) = H(X Sf]TnCl | Z) + H(X (SuT)nCi | Z) 

= #(^(SnA)n(mCi)l^) + H{X(SruC 1 )u(TnCi)\Z) 
< H(X SnCl \Z) + H(X TnCl \Z) 

= h c ,i(S) + h Cjl (T) (22) 

Hence hc,i is sub-modular. In addition, we have: hc,i(S\S ) = H(X\; ri c 1 \Z) — H(Xsc n c 1 \Z) = 
H(X s \X Cl \ s , Z). Note that Ri S = Risnd, thus ((21]) is equivalent to 

V5 C V : Ris > h c 4S\S c ) (23) 

Now it follows from Definition [2] that Ri is contained in the SW-region of the first layer, iff it majorizes 
the essential polytope of he i, i.e., R\ >~ Ph c i- 
2) In the layer 2 < i < K + 1, assuming that (X% , Y^i-i) has been decoded at the receiver from the 
previous layers (where D = if^L^Ck), using SW-coding (X2 ,Y^_) is encoded with rates Ri = 
(Ril, Ri2: • • • , Riv) m which for v A_i U £j, we set Ri v = 0. The receiver can reliably decode 
(XI, YU provided that, 

VS C A_! U d : Ris > H{X SnC Y Sn c^ 1 \X CAS Y Cz _ AS X c ,Y c ^Z) (24) 

Define the function hc,i ■ 2 V — > M. as follows: 

hcAS) = HiXsrXtYsrXt-AXvYct-rZ) 



Now in similar manner to ( [22] ), it can be shown that hc,i is sub-modular. Following similar steps 
described in the previous stage, we conclude that Ri is contained in the SW-region of the layer i, iff 
it majorizes the essential polytope of hc,i, i-e-, Ri >~ P/i c ( . 
Define R = Rk (which is the overall rate vector) and he — ^2^=1 hck- We showed that R >~ P/i c - 

On the other side, suppose that the point R majorizes Pfc c , so there is a point R* G Ph c such that R >~ R*. 
Applying Lemma [3] to (hc,k '■ 1 < k < K + 1), we have = Ylik=i ~ P/ic k - Hence there are points 
(R* k e V hck : 1 < k < K + 1) such that R* = Y,k=i R %- Let R k = K + where AR = R- R*. 
Now we have i? = ^2^=1 Rk and for all k, R^ >~ P/j c k . Thus, each rate vector R satisfying R y P^ c can 
be achieved using ML-SW coding with respect to C. Therefore the set of all achievable rates with respect 
to C is given by: 

K c = {R£RW :Ry-Ph a } 

K+l 

= {R€ :VSQV,R S >J2 H(X Sr , c Y Sr ,c l _ 1 \X CAS Y^_ AS X Yc^Z)} (25) 

i=l 

The next theorem, is the main result of this section. 

Theorem 1 (SW-identity): The set {IZc ■ C G Jy} is a closed covering of Ksw which is the SW- 
region defined by: 

Usw = {R€ M |v| :VSCV,R S > H(X s Y s \X s cY s c Z)} (26) 
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Proof: Define the function h : 2 V -> R with /t(«S) = fZ"(X 5 y 5 |Z). ft, is a sub-modular function with 
the essential polytope P/j. By definition, a point i? belongs to SW-region iff it majorizes P^. To prove the 
theorem, we must show that 

n sw = U nc W 

Applying Equation @ to the RHS of ((27]) yields, 

J = {R G KM : i? >- U P U (28) 

Thus, to prove the theorem, we only need to show that {Pfe c : C G -Fy} is a closed covering of P^. 
We prove this by strong induction on |V|. For N = 1 as base of induction, it is clear (The case N = 2 
was proved separately in the beginning of the section). For |V| > 2 assume that the theorem holds for any 
V with size |V| < N — 1. We show that {Ph c '■ C £ ^v} an d Pfe satisfy the conditions of Lemma [TJ thus 
{Ph c '■ C G -7~v} is a closed covering of P^. 

Claim 1: For any ordered partition C of V, we have 

Phc C Pfc. 

Proof of Claim [7J First note that, (See equation (24 1) 

^ C («S|«S C ) = J] ^(^nr.^nA-xl^A^^-AS^^-^) (29) 
i=l 

> ^THiXsncYsnc^lXscYscXsnaYsno-iZ) (30) 

i=l 

= i7(x lS y 5 |x lS cy 5 cZ) (3i) 

= h{S\S c ) (32) 

where ([30) follows from the fact that (£A 5 ) u C 5 C U (5 n £*) with equality holds if 5 = V and 
conditioning does not reduce the entropy, and ( |3T] ) follows by the chain rule, since {CiC\S}f =1 is a partition 
of S. Now we can conclude the claim from ( [32] )- □ 

Claim 2: Suppose F-yc j- is a subset of V that consists of all ordered partitions which are generated by 
concatenating an ordered partition of T c and an ordered partition of T, i.e., 

T r cj = {C G F v : C = [Ci, C 2 ], Ci G J" r ° and C 2 G J" r } 

Then, the set of facets {F^ j : C G J7-C 7-} is a closed covering of F/, .7-. 
Proof of Claim [2]By Lemma [2] F/^7- is given by: 

F ftjT = {x G M V : x r G , x r c G P^ 2 } (33) 

In which P^ and P^ 2 are the associated essential polytopes of sub-modular functions h\(S) = H{X$Ys\Z) 
and fi2(S) = H{XsYs\X-ycY-yc Z) with domains 2^° and 2 T , respectively. More precisely, P/ ll and P/j 2 
are given by: 

P hl = {x G M TC : x T c = i?(X rC y r c|Z), and VS C T C ,x 5 > (X s Y s |X s0nrc y s0nr cZ)} 
P h2 = {x e M r : x T = H(X r Y r \X r cY r cZ), and VS C 7>s > # (X s Ys|X s0nr Y SCnr X rc y rc .Z)} 



10 



Now, since the size of T c and T are smaller than N, by applying the induction assumption to essential 
polytopes Pf ll and P^ 2 (with side information Z = (Xfc , Yfc , Z) at the decoder), we obtain: 

P^= U P fcl.Ox 

Ci &T T c 

P h2 = |J P h2C2 (34) 
c 2 eJY 

where Ci = [£1,1, • • • , £1 k x ], C2 = [£2,1, • • • , C-2,K 2 ] an d the functions foi,Ci an d ^2,c 2 whose domain 
are 2' and 2' , are defined by: 

/n, Cl («5) = J] ^sn^^n^.J^^- 1 ^ (35) 

k=l 
K 2 + l 

h 2 ,c 2 (S) = E H ( X SnC 2 , k YsnC^\X ci Y cr rZ) (36) 
fe=i 



Using ( f35) and ([36]), we obtain and P^ 2 c as: 

P hi ci ={x G M TC : x T c = H(X T cY T cZ), and VS C T C 

* 5 > E ^(^5nz: 1 ,^5nz: 1 ,,_ 1 |^£ I ,A5^: I , fc _A5^ y cr^)} (37) 
fe=i 

P h2 C2 ={x G M r : x T = H(X T Y T \Z), and V5cT 

K 2 +l 

fe=i 

Let C = [£1,1, • • • , £i,Ki j £2,1 j • • • ,£2,if 2 ] b e tne concatenation of Ci and C2. We assert that 

F ho>T = {x G R V : x r c G P hliai , x r G P^ } (39) 
By Lemma [2j x belongs to F/ tc 7-, iff 

5 Cf c : x 5 > /ic(5|r c n 5 C ) with equality for S = T C 



5Cf: x 5 > /i c (5|5 G ) with equality for S = T 



To evaluate ( |40| ), consider 

fc=i 

K 2 +l 



(40) 



+ F(X lSn£2 , i y 5n£liA . i |X r cy £ , 1 Z) + £ H(X S nC 2 , k Ysnc 2 , k .AX ck Y c ^Z) (41) 



fc=2 
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where we have used the fact that C^ 1+l = T c . Now, we compute the RHS of (40 1: 

h c (S\T c n S c ) = hciT ) - h c (T c n S c ) 

= HiXc^Yc^X^Y^Z) - H{X S a nC ^Y S a nC ^_ x \X ci Y cki -,Z) (42) 
fe=i 

= H { x sr\C 1 , k ysr\C 1 , k - 1 \X Cl ^\sYc 1 , k - 1 \s x ci Y c k 1 - lZ ) (43) 

k=l 

K 2 +l 

h c (S\S c ) = ^(^n£ 2 , fc W 2 , fc _ 1 |^£ 2 , fc \ l s>£ 2 , fc _ 1 \-s^ y £ 2 fc - 1 ^) (44) 

k=l 



where ( |42] > follows, because S C T c and T c are disjoint from all £2,1. (44 > follows from the fact that T 
is disjoint from all 



Now ( |37| ), ( |38| ), ( |43| > and ( |44| ) together show the truth of assertion. Finally, the assertion with ( |34| ) implies 
that for each point x G F/^7-, there exists an ordered partition C € Tfo 7- for which F/^ 7- contains x. 
This completes the proof of Claim 2. □ 

Claim 3: For each facet Fh c ,T of an essential polytope of a given ordered partition C inside the P^, 
there exists an ordered partition C* / C, such that 



P hc f]P hc ,=F 



ho,T 



(45) 



Proof of Claim [IJ Let C = [Ci, ■ ■ ■ ,Ck\- From the proof of Claim |2j the corresponding facets to 
(Cf = Llf =i Ck '■ k > 2) lie on the boundary of P^. Thus, we only consider the facets corresponding to 

' C* K , C* K+l ], where £* = (T n U {T c D 

F ftc „r°- (46) 



T ^ Cf = U« =i £ k . For such T, set C* = 



Now we show that 



F h c ,T 



This proves Claim [3j because P/j c gets the minimum of xj- on the F^f, and Pft c gets the maximum of 
xj- on the F^j-o (since X7- = ff(XyYy|Z) — a*r°)- 

We provide the formal proof of (|46j> in Appendix [C] Instead, we give the main idea behind the construction 
of C*. First, consider the simple SW-coding of a DMCS X\> with rate-tuple Ry. It is well-known that the 
minimum of Rj- is achieved with joint decoding of Xq-c with sum-rate Rfo = H(Xj-c\Z) followed by joint 
decoding of Xj- in the presence of X-jc at the decoder with sum-rate Rj- = H(X-r\Xfc Z). Also, Lemma 
[2] confirms this result about any sub-modular function. Moreover, this lemma tells us that each point which 
achieves the minimum of Rj- can be obtained by this two-level decoding. Now consider the ML-SW coding 
with respect to C. Each point of ML-SW region can be written in the form R = Rk, where R k lies 

in the SW-region of layer k. So Rj- can be split into the rates Rut = R. 



■fc,rn(/: fc u£ fe _i)- 



Thus to minimize 



Rj, we require to minimize each of -Rfc,Tn(£ fc u£ fc _i)- I n layer k, SW-coding is done over (Xc k ,Yc k _ 1 ), 
therefore to achieve the minimum of ^fc,Tn(£ fe u£ )c _i)> it suffices to consider two levels of decoding at the 
decoder: the decoder, first decodes (Xj-c n £ h , Yfc n £ k _ 1 ) in the presence of (X ck ,Y^ k -i, Z), then decodes 
(X TnCk , Y TnCk _ 1 ) in the presence of (X Ck , Y C k-i, X T c nCk , Y T c nCk i , Z). In overall, to minimize R T with 
respect to C, one can consider the following 2K + 2 levels of decoding: 



x T c r\C t , Xrnd , ( x T c nC k , Y T c nC k - 1 ) , ( x TnC k , *rn£ fc _i ) ,"■ , Y T c nC K , Y TnC K 



(47) 
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On the other side, to maximize Rj- (or equivalently, to minimize Rq-c) with respect to C*, the following 
order on SW-coding is required, 

XrnciiX-j-cnc*,- ■ ■ , (AYac*, Ymc^J, (^r c n£^ *T c n££_i)> " " " > Y TnC K+1 , Yr c nc* K+1 (48) 

Now, note that T c C\C* k = T c C\{(TC\C k -i)U{T c C\C k )) = T c nC k and TnC* k = Tn((Tn£ k -i)U(T°n 
C k )) = T n in particular TnC\= T c C\£* K+l = 0. Comparing ((47) with (|48), we see that these two 

multi-level decoding schemes are the same, thus the intersection of P/j c and Ph c , is f?h c ,T = ^h c , T c- ^ 
Now Claims [T]-[3] ensure that P^ c and satisfy the conditions of Lemma [T] This completes the proof. 



V. Feasible Constraints for reliable multicasting of a DMCS over cooperative 

NETWORKS 

In this section, we obtain a set of DMCS which can reliably be multicast over a cooperative network. Our 
approach is based on generalization of the CF strategy for relay networks. Two types of generalization have 
been considered in the previous works, JSJ, p2j . In (SJ, the CF strategy was generalized in the following 
manner: 

1) Each relay and destination partially decode the messages of the other relays. 

2) Each relay compresses its observation Y v , in the presence of side information from messages of the 
other relays. 

3) Each relay sends its compressed observation through a Multiple Access Channel (MAC). Finally, 
destination decodes the source message. 

This scenario deals with relays in a symmetric way, i.e., all relays lie in a single MAC layer. In p2j , a 
generalization of mixed strategy of [ 1 1 , Theorem 7] is proposed. By relaxing the partial decode-and-forward 
part of the mixed strategy, we obtain a generalization of the CF strategy. In this scenario, relays are ordered 
according to a given permutation. Each relay compresses its observation using multiple description method 
(MD) and sends these descriptions through a broadcast channel with a degraded message set. Each relay 
and destination decode their respective descriptions after decoding their broadcast messages according to a 
sequential decoding scheme. However, if the relays use the simple Wyner-Ziv coding rather than MD, the 
result is a special case of (8] Theorem 3]. In another scenario proposed in p3| , CF is generalized for half- 
duplex channels. Although, this method is proposed for half-duplex relay networks, it can be generalized 
for relay networks, too. In this scenario, each relay uses the simple Wyner-Ziv coding. This scenario differs 
from the previous generalization of CF, in which the destination considers an ordering of relays, and decodes 
the compressed observation of relay k, in the presence of compressed observations of relays (k + 1, k + 
2, • • • , N — 1) which are decoded in the previous blocks. This is similar to ML-SW coding. 

We propose a joint source coding and Wyner-Ziv coding for multicasting a DMCS over cooperative 
networks. In this scenario, in each block, each node compresses its observation using Wyner-Ziv coding, 
then in the next block jointly maps the compressed observation and its current source sequence to a channel 
input codeword and transmits the codeword. The joint encoding used in this scheme, benefits from the 
advantage of joint source-channel coding in comparison with source-channel separation in the multicast 
scenario, which is illustrated in [1]. Moreover, in this scheme, each node has two types of sources including 
the compressed observation and the source sequence which are required to decode at each destination. By 
the nature of relaying, it is not possible to decode these sources, simultaneously. This situation is similar to 
ML-SW coding, in which two components of the source are not being decoded simultaneously. Motivated 
by the results of ML-SW coding, e.g., Theorem [TJ each destination groups the other nodes into some layers 
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according to its ability to decode the information of other nodes. Using insights from the ML-SW, in the 
first level of decoding, the destination can directly decode the first component of the information of nodes 
in the first layer, i.e., the source sequences of the first layer, through a MAC layer between layer one and 
the destination, and in level k of decoding, destination decodes the source sequences of layer k and the 
compressed observations of layer k — 1 (second component of information of layer k — 1) jointly through 
the MAC layer between layer k and the destination in the presence of the decoded information from levels 
(1, 2, • • • , k — 1) as side information. These side information play two roles in improving the decoding: 

1) These are side information for Slepian-Wolf coding that enlarge the SW-region. 

2) These are side information for MAC that enlarge the MAC -region. Unlike the first role, this role does 
not arise from the ML-SW. 

Enlarging the SW-region and the MAC -region provides the opportunity for some intersection between the 
two regions which results in the reliable transmission of source sequences of the nodes in layer k and the 
compressed observations of the nodes in layer k — 1 in an operational separation sense, even if the original 
MAC region does not intersect with the original SW-region. 

The next theorem is the main result of the paper. 

Theorem 2: The set of DMCS U A can reliably be multicast over a cooperative network to nodes in V, 
if there exist auxiliary random variables Yy and Q, such that for each S C A, we have 

H(Us\U A \ s ) < min min [I(X W ; Y di Y W c\ {di} \X w aQ) - I{Y W] Y w \X v Y di Y W o\ {di} Q)\ (49) 
where the joint p.m.f. of random variables factors as 

q)]p(yv\xv)- (50) 



Remark 2: The constraint (49 1 separates source coding from channel coding in the operational separation 



sense [1]. To see this, observe that the constraint ( |49| ) is equivalent to the following constraint, 

VW C V,di G W c : H(U WnA \U A \ w ) + I(Y w ;Y w \X v Y di Y W a\ {di} Q) < I(X w ;Y di Y W a\ {di} \X W oQ), 

(51) 

Consider a cut A = (W,W C ). The RHS of the ( |5"Tj ) provides an achievable flow through the cut A. The 
first term in the LHS of ( |5Tj ) represents the rate of the Slepian-Wolf coding for describing Uyvc\A to the 
destinations in the other side of the cut in the presence of U A \yy which is available in W c . The second 
term in the LHS of ( |5T| ) can be interpreted as the rate of the Wyner-Ziv coding for describing a compression 
of the observation lyy, i.e. Yyy to the other side of the cut in the presence of (A"yyc , Yyy/c ,Y di ) and Xyy, 
which the latter can be regarded as the output of channel decoder. Since the compression rate of the sources 
is less than the information flow, one can expect that the multicasting of the sources is feasible, due to the 
source-channel separation approach. 

Proof of Theorem^ For the sake of simplicity, we assume that \Q\ = 1 where Q is a time-sharing 
random variable. First, we characterize a set of DMCS which can reliably be multicast over a cooperative 
network, with respect to given ordered partitions at each destination. For each destination node di, let 
V-di = V\{di}. The following lemma, establishes a set of sufficient conditions for reliable multicasting of 



U A over the cooperative network. We provide the proof of it in Subsection V-A 

Lemma 4: The set of DMCS U A can reliably be multicast over a cooperative network to subset V of 
the nodes, if for each di G V, there exists an ordered partition C^*) = [Ci, C%, ■ ■ ■ ,Ce] of V- di such that 
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for each S C V-d 4 , the following constraint is satisfied: 

e+i 

J2 H ( x t) + #(*t|X^) > J2 {H(U S nc k \U Ch \ S Uc*U di )+ 
teS k=i 

H(Xsnc k %nc k - 1 \X Ck \sYc k _ 1 \s x c k Yc k - 1 Yd l X d .j^, (52) 

where the random variables (xv,yv>yv) are distributed according to ( [5Q] >. 

This lemma gives a partial solution to the problem of reliable multicasting of U A over the cooperative 
network, in the sense that to find out that the multicasting of C/4 is feasible, we must consider all the 
possible ordered partitions of each set V- di an d check that the constraint (52 1 is satisfied or not. If for 
each destination node di, there exists at least one ordered partition of V- dt such that ([52]) is satisfied, then 
reliable multicasting is feasible. Since the number of ordered partitions of a set V grows rapidly with |V|, 
such approach (checking the constraint (52 1 for all the ordered partitions) seems to be difficult. However, 
using Theorem [TJ we show that there exists a set of constraints that unifies the set of constraints ((52]) with 
respect to all the ordered partitions. The following lemma, establishes such a result. 

Lemma 5 (Unified Sufficient Conditions): For a destination node di, there exists at least one ordered 
partition C( di > of V- di for which the constraint (52 1 is satisfied, if and only if the following constraint is 
satisfied, 

VS C : H ( X t) + H(Y t \X t Y t ) > H(Y s X s \X s cY s cY di X d J + H(U s \U s aU di ). (53) 
tes 

Proof of Lemma^ For each v G V, define R v = H{X V ) + H(Y V \Y V X V ) and 

RW = (R u ... ,i? di _ 1 , J R di+1 ,-.. ,R V ). 



Consider the RHS of ([52]). Since random variables C/4 and (Xy,Yy,Y\;) are independent, the constraint 
(52 1 can be rewritten as 

e+l 

V5 C V- di : 4"° > ^H(U Sn c k X SnCk Y S nc k jU Ck \ S X Ck \ s Y Ck _ AS Uc k X ck Y c ^U d Y d M. (54) 

k=l 

The RHS of ([54]) can be expressed in the form of ([25]) with V = V- di , X v = (X v , U v ), Y v = Y v and 
Z = (Y di , X di , U di ), thus the constraint (52) is equivalent to R^ £ lZ C (d z ). Therefore for the node di, 
there exists at least one ordered partition of V- dt such that ( [52] ) is satisfied, iff R^ G ^c^eTv ^c< d .'- 
Applying Theorem [I] we conclude that such exists iff (53) is satisfied. □ 

The constraint ( [53] ) can be rewritten in the following form: 

V5 C A\{di} :H(U S \U A \ S ) < mm R$ ] - H(Y w X w \X w cY w c\ {di} Y di ), (55a) 



d i €W c 



VS C A c \{di} :Rf ] - H(Y s X s \X S cY S c\ {dt} Y di ) > 0. (55b) 



Consider the constraint ([55]). In Appendix [Dj using the joint p.m.f. ( [50] ) we will show that this constraint is 
equivalent to the following constraint 

VS C A\{di} :H(U S \U A \ S ) < nun [I(X W ; Y d Y w a\ {di} \X w c) - I(Y w ;Y w \X v Y di Y w c\ {di} )], (56a) 
VS C ^ c \{^} :I(Y s ;Y s \X v Y S c\ {di} Y di ) < I(X s ;Y S a Wi} Y di \X S c). (56b) 
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The first constraint (56a I is the same as the constraint (49), so we only need to show that the second 
constraint ( |56b| ) is an additional constraint. The second constraint represents a sufficient condition for 
reliable multicasting of the compressed observations of the non-source nodes to the destinations. Since 
the destinations only need to decode the sources and do not need to decode any other information, it is 
logical to neglect the second constraint, which completes the proof of Theorem [2] We provide a rigorous 



proof of this fact in subsection V-B 



A. Multi-Layer Slepian-Wolf coding over a cooperative network (Proof of Lemma^ 

We transmit s n B = nS-length source over cooperative network in B + 2V — 3 blocks of length n where 
V is the cardinality of V. Observe that r n s = n(B + 2V — 3) and — > 1 as B — > oo, thus the sequence 

{(s n B, r n B)}%° = i satisfies the condition of Definition [5] 

Codebook generation at node v: Fix < e" < e' < e. Also fix 6 > such that | T?(U V )\ < 2 n( - H ^ +5 \ 
To each element of T™(U V ), assign a number w v G [1, 2 n( - H( - Uv ^ + ^] using a one-to-one mapping. Moreover, 
for each non-typical sequence, set w v = 1. Denote the result by u v (w v ). For channel coding, independently 
repeat the following procedure V times. Denote the resulting k-th codebook by C v (k). 

Choose 2 n ( f ^ f/ ") +/ ( r " ;yi, l X ") +25 ) codewords x v (w v ,z v ), each drawn uniformly and independently from 
the set T™,(X V ) where z v G [1, 2 n ( I ( Yv '' Yv \ x ^ + ^\. For Wyner-Ziv coding, for each x v (w v ,z v ) choose 
2n(i(Y v ;Y v \x v )+6) codewords y v (z' v \x v ), each drawn uniformly and independently from the set T"(Y v \x v ) 
where z' v G [i,2 n W Y ^ Y M+% 

Encoding at node v: Divide the nS-length source stream u™ B into B vectors (u^ui : 1 < j < B) where 
u v,\j] = { u v,(j-i)n+i, " " ' > u v,jn)- We say that the channel encoder receives = (m v m, ■ ■ ■ ,m v \B]), if 
for 1 < j < B, UyN-i is assigned to m v u] G [1, 2 n ( H ( u ^ + ^]. Encoding is performed in B + 2V — 3 blocks 
where in block b, we use the codebook C v (b mod V). For 1 < b < B + 2V — 3, define: 

_ / m V)[b _ v+1] ,V <b<B + V-l 
v > [b] \ 1 .otherwise. 

In block 1, a default codeword, x„(l, 1) is transmitted. In block b > 1, knowing z v ^ b _i\ from Wyner-Ziv 
coding at the end of block 6—1 (described below), node v transmits yLv(w v ™, ^«j6-i])- 

Wyner-Ziv coding: At the end of block b, node v knows (x„ ,[6-i], y v ,[6-il) an d declares that z„,[6-i] = z v 
is received if z v is the smallest index such that (y?;,[b-i](^l x D,[6-i]) 5 x i.,[f)-i]iyD,[fe-i]) are jointly typical. 
Since we have more than 2 nl ^ Ynj ' Y ^ x ^ codewords, such a z v exists with high probability. (See Table [i] which 
illustrates encoding for a network with four nodes in which node 4 is only a destination, i.e., ^4 = ^4 = 0.) 

Decoding at node d{.- Let = [C\, ■ ■ ■ ,£g] be an ordered partition of the set V-d z = V\{d%}. We 

propose a sliding window decoding with respect to C^ d '\ Define s v rw = (w v m, z v ,[b-i])- Suppose that 
( s £i,[fe-i]) s c 2 ,[b-2]> " " ' j s c e ,[b-e]) have been correctly decoded at the end of block 6—1. Node di, declares 
that (sc u [b], ■ ■ ■ , sc ej [b-e+i}) has been sent, if it is a unique tuple such that for each 1 < k < I + 1 satisfies 
the following conditions, 

(x-Cfc [6-fe+l])) y£fc_i ( z C k - U [b-k+l] \ :x -C k - 1 ,[b-k+l])i :x -£ k ,lb-k+l] 7 

yc k -\[b-k+i],yd t ,[b-k+i],^di,[b-k+i]j G T c \ for all k such that k < b 

{uc k {w Ck ,[b-k+i\)^ C k(w C k , b _ k+1 i),u di (wd it [b-k+i\)) e T l\ for a11 k such that V <b- k + 1 <V + B - 1 

(57) 
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TABLE I 

Encoding Scheme for Multicasting of two blocks of source sequences over a network with V = {1, 2, 3, 4}, 

A — {1, 2, 3}, V = {3, 4} AND NODE 4 HAS NO CHANNEL INPUT, I.E., W 4 = X 4 = 0. 



Node 


Block 1 


Block 2 


Block 3 


Block 4 


Block 5 


Block 6 


Block 7 


1 


xi(l,l) 

yi(*i[i]l x i[i]) 


Xl(l,2Tl[l]) 
yi(^l[2]|xi [2] ) 


Xl(Ml[2]) 
yi(^l[3]|xi[ 3 ]) 


Ul ("*![!]) 
Xl (mi [!],*! [3] ) 
yi(zi[4]|xi[ 4 ]) 


ui(mip]) 

Xl(mi[2],2l[4]) 
yi(2l[5]|xi[ 5 ]) 


Xl(Ml[5]) 

yi(zi [6] |x 1[6] ) 


Xl(Ml[6]) 

yi(«i [7 ]|x 1[7] ) 


2 


x 2 (l,l) 
ya(z2[i]|x 2 [i]) 


x 2 (l,z 2 [i]) 

y2(«2[2]|x 2 [2]) 


X 2 (l, Z 2 [2]) 
y 2 (22[3]|x 2 [3]) 


U 2 ("l2[l]) 
X2(m 2 [l],^2[3]) 
y2(«2[4]|x 2 [4]) 


X2(m 2 [2],^2[4]) 
y 2 («2[5]|x 2 [5]) 


X2(1,Z 2 [5]) 
y 2 (z2[6]|x 2 [6]) 


«2[B]) 
y2(z 2 [7]|x 2 [7]) 


3 


x 3 (M) 
y3(z 3 [i]|x 3 [i]) 


X 3 (l,2 3 [l]) 
y3(23[2]|x 3 [2]) 


X 3 (l,2 3 [2]) 
y"3(z3[3]|x 3 [3]) 


u 3 (m 3[1] ) 

X3(m 3 [ 1] ,Z3[3]) 
y3(«3[4]|x 3 [4]) 


U3(m 3[2 ]) 
X3(m3[2],2 3 [4]) 
y3(z 3 [5]|x 3 [5]) 


X3(l,«3[5]) 
y 3 (^3[6]|x 3 [6]) 


X3(l,2 3 [6]) 
y 3 («3[7]|x 3 t7l) 



TABLE II 

ILLUSTRATION OF DECODING SCHEME OF FOUR-NODE NETWORK DEPICTED IN TABLe[T] AT NODE 4 WITH RESPECT TO THE 
ORDERED PARTITION C (4) = [{1, 2}, {3}] AT THE END OF BLOCKS 2 AND 5. THE GRAY CELLS HIGHLIGHT THE RANDOM 
VARIABLES CORRESPONDING TO THE UNKNOWN INDICES AND THE YELLOW CELLS HIGHLIGHT THE RANDOM VARIABLES 
AVAILABLE AT DECODER WHICH WILL BE USED FOR DECODING OF THE UNKNOWN INDICES THROUGH A JOINT TYPICALITY 
CONDITION BETWEEN THEM AND THE GRAY RANDOM VARIABLES. 



Node 


Block 1 


Block 2 


Block 3 


Block 4 


Block 5 


Block 6 


Block 7 










uiK[i]) 


Ui(mi[2]) 






1 


XI (1.1) 


Xl(l, «![!]) 


Xl(l,2l[2]) 


Xl(«li [3]) 


xi(m 1[2 ],zi[ 4 ]) 


Xl(l,Zl[5]) 


Xl(l,Zl[6]) 




yi(«i[i]|xi[i]j 


yi(zi[2]|Xl[ 2 ]) 


yi(zi[3]|xi[ 3 ]) 


yi(zi[4]|xi [4] ) 


yi(zi[5]|Xl[ 5 ]) 


yi(zi[6]|Xl[ 6 ]) 


yi(Zl[7]|xi[7]) 










U2(m 2 [i]) 


U2(m 2 [2]) 






2 


x 2 (l,l) 


X2(l,2 2[1] ) 


X2(1,Z 2 [2]) 


x 2 (m 2 [i],z 2 [ 3 ]) 


X2(r71 2 [ 2 ],Z 2 [4]) 


X2(1,Z 2 [5]) 


X 2 (1,Z 2 [6]) 




ya(2a[i]ix a [i]) 


y 2 (2 2 [ 2 ]|x 2 [2]) 


y2(z2[ 3 ]|x 2 [ 3 ]) 


y2(z2[4]|x 2 [4]) 


y 2 (z 2 [5]|x 2 [5]) 


y2(z 2 [6]|x 2 [6]) 


y 2 (z2[7]|x 2 [7]) 










U3(m 3[1] ) 


" 3 ( m 3 [2]) 






3 


x 3 (l,l) 


X 3 (1,Z 3 [1]) 


X 3 (1,Z 3 [2]) 


x 3 (m 3 [i],z 3 [ 3 ]) 


X 3 (m 3 [2],Z 3 [ 4 ]) 


X 3 (l,Z 3 [ 5 j) 


X 3 (1,Z 3 [6]) 




y3(z 3 [i]|x 3[1] ) 


y3(«3[2]|x 3 [2]) 


y 3 (z 3 [ 3 ]|x 3[3] ) 


y 3 (z 3 [4]|x 3[ 4]) 


y 3 (z 3[5 ]|x 3[5] ) 


y 3 (z 3 [6]|x 3 [6]) 


y 3 (Z3[7]|x 3 [7]) 










U 4 [4] 


U4[5] 






4 


y*[i] 


y4[ 2 ] 




y 4 [4] 


y4[5] 


y 4 [ 6 ] 


y4m 


Decoding 








_ m {l,2},[l] 


™'{1,2},[2]>'^3,[1] 






at node 4 





%,2},[1] 


%,2},[2]4 3 ,[1] 


Z{1,2},[ 3 ],Z 3 ,[2] 


Z{1,2},[4],«3,[3] 


%1,2},[6]>«3,[4] 


Z{1,2},[6].Z 3 ,[5] 



where s Ck t [b-k+i] = {wc k ,[b-k+i\^c k \b-k})- Note that at the end of block b + V + i — 2, the vector 
w A,[b+v-i] = m A,[b] is decoded. Since each (u„ rw : v G A) is jointly typical with high probability, we find 
the source sequence r b i with small probability of error. Hence at the end of block B + V + £ — 2, u'^ 3 
is decoded with small probability of error. 

Note that in the first V — 1 blocks of decoding, no sources information is decoded. The advantage of 
decoding compressed observation in these blocks is to provide side information at the receiver, in order to 
improve decoding in the next blocks. 

Example 1: Consider the four-node network of Table [TJ Here we assume that node 4 observes source U± 
correlated with the other sources. Let = [{1, 2}, {3}]. Decoding at node 4 begins at the end of block 
2. In block 2, node 4 declares that £r 1)2 }m is decoded if (xi(l, % m), X2(l, ^2,[l])> y4[2l) anc ^ 
(yi(%,[i]|xi[i]),y2(22,[i]|x2[i]),x 1[1 ](l,l),X2[i](l,l),y 4 [i]) are jointly typical. In the next block, (%i, 2 },[2]^3,[i]) 
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are decoded and in block b, ("^{l^},^] ? %i,2},[6— i]i ^3,[6— i]j ^3,[6— 2]) WQ decoded, if (See Table |TTJ> 

( u {l,2}(w{l,2},[6]),U 4 (u> 4 [b])) G I? (58) 

( x {i,2}(^{i,2},[6]^{i,2},[fe-i]),y4[fe]) e T™ (59) 

(U3(^3,[6-1]),U{1,2},[6-1],U4(W4[6-1])) G ( 60 ) 
( x 3(^3,[fe-l])^3,[b-2]) 5 y{l,2}(^{l,2},[6-l]l x {l,2}[6-l])i x {l,2}[6-l]iy4[6-l]) G T™ (61) 
(y3(^3,[b-2]l x 3[b-2])> x {l,2,3}[6-2] ; y{l,2}[6-2] ; y4[6-2]) G T™ (62) 

£rror Probability Analysis: Let U„ p,_y + ]i be the observed sequence at node v, which is used for 
encoding in block b. We bound the error probability of decoding at the end of block b averaged over 
A[b-V +ih U.4[&-V]) • • ■ , U A[b-i-V+2\) an d ai l random codebooks, assuming that no error occurred in 
the decoding of the previous blocks. Let S v u\ = {W v uh ^«h-i])> i n which W v u\ and Z v u_i^ are the indices 
of U v>[b _y +1 ] and Y„ j[6 _x], respectively. Define S b = (S Cl [b],' •; , S Cf \b-t+i})- Also, let s = (s Cl , ■ • • , s^J, 
in which s v = {w v ,z v ) : w v G [1, 2 n ( H ( t/ ») +a )], G [1, 2™( / ( y -^l X ")+< 5 )]. Define the events, 

£o(b,k) := {(Vc k ,[b-k-V+2],'^C k ,[b-k-V+2],'^d l ,[b-k-V+2]) ^ T e} 

:= {(X Vi ^ k+n ,Y Vt ^ k+n ,%(z v \X Vt[b _ k+1] )) i T%, for all z G [1, 2 »« i ^)+< 5 )]} 
£ 2 (b,k,s) := {(uc k (wc k ),U ck}[b _ k _ v+ 2],\J di}[b - k - V+ 2}) G T™} 

£ 3 (&, fc,s) := {(X£ fc (s£j, Y£ fc _ 1 (z£ fe _jX£ fc i [ 6 _ fc+1 ]),X £fc j b _ fe+1 ], Y £fc -i [ 6 _ fc+1 ], Y d . > [ fe _ fc+1 ],X 4 j b _ A;+1 ]) G 

(63) 

Then the error event £ (b) corresponding to decoding at the end of block b can be expressed as 

m = u£tl M&, fe) U u ^v£i(&, fc, u) (J 4^ fc, S 6 )) |J u s ^s 6 ( n e k t\ £ 2 (b, k, s) n £ 3 (b, k, s)) . 

Using the union bound, we bound above the probability of error as follows: 

F[£{b)} < P[ujg£b(&, fe)] + P[u£i U, ev fe, u)] + P[u£i(£f (&, k, S b ) f| n„ 6V «P(6, "))]+ 

P[u s ^ Sb ( n't* f 2 (b, k, s) n f 3 (6, fc, s))]. (64) 

By the typical lemma [31, Theorem 1.1], the first term vanishes as n — > oo, the second term vanishes since 
at each node v and for each input x„ there are more than 2 nI ( Yv ' Yv \ Xv ) codewords yv{zv\Xv,[b-k+i\)> 

and the third term vanishes by 1 3 1 Theorem 1.2.]. For the last term, let £i(b) = U s ^s b ( ^k=i ^2(b,k,s) n 
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£ 3 (b,k,s)). 

< ^2p(u A[b _ e _ V+2 ]) • • ■ p{u A[b _ v+1] )P[S b = Sb\uA[b-£-V+2}, ■ ■ ■ ,U A[b _ v+1] 



W-A[b — t-V + 2] ,'" V + 1] ,Sb 



E P Kt 1 i^(6, fc, s)|u^ [6 _ £ _ v+2] , • • • , u # . v+1] ]PKt^3(6, fc, s)|S 6 = s 6 ] (65) 



s^s b 

= X^P( u ^[6-f-y+2]) • • • P( u A[b-V+l])^[Sb = S b \u A [ b _ e _ V+2 ], ■ ■ ■ ,U A [ b _ v+1 ]] 

^■A[b—i—V+2]j'" , u A[b- V + 1] ,Sfo 

E II P ^ 2 ( 6 ' s )l u ^[6-fc-v+2]]P[f3(b, fc, s)|S 6 = 8 6 ] (66) 

S^Sf, fc=l 

= y]p{ u A\b-e-V+2\) • • ■ P( u A[b-V+l])^[ S b = s b\^A[b-(-V+2] , ' ' ' i u A[b-V+l]\ 

VLA[h—£—V+2])"* , u A[b- V + 1] ,Sfo 

e+i 

E II iK^C^J.^^fc-v+a],!^,^.^]) G r e n ]P[£ 3 (&,M|S 6 = s 6 ] (67) 

S^Sb fc=l 



where ( |65j ) follows from the fact that the codebook generation is independent of the sources U A , ( |66| ) 
follows from the fact that the codebooks used in any I < V consecutive blocks are generated independently 
and the fact that the sources are i.i.d., therefore the source sequences are independently generated in the 
consecutive blocks and (67i follows from the definition of £2(6, k,s), in which 1 represents the indicator 
function. Define, 

Ns,z{W-A\b-e-V+2}) ■ ■ ■ > u A[b-V+l]> s b) = js : S t / St,[6_fc + 1]> Z V / ^i', [&-&]) 

for all k£ [l,i + l],teSnC k ,t' £ZnC k , and s Ck \s = s Ck \s,[b-k+i],for all fc G [1,4 (68) 
and (u jCfc (u; i cJ,u jC k ) [ 6 _ Jfc _v+2] ! u tlit [ 6 _ fe _v +2 ]) G T™, for all fc G [M]}. 
Then, ([67]) can be rewritten as, 

< E^M^-V+2]) ' ' , P( u >l[6-V+l]) 1P [ S 6 = S6|U t/t p_/_v +2 ], • • • ,U^ [fe _y +1] ] 

U.A[>— l— V+2]j"' > u A[t>- V + 1] ,Sb 

£+1 

EE E n P ^(fe,fc,s)|Si = s b ] (69) 

Define, 

£+1 

F s , z = E n P ^( & >^ s )|Sb = S6]- (70) 

sEN s ,z(uA[b-e-v+2],-" ,UA[i-v+i],s b ) fc=l 

Notice that there are pairs (S, Z) such that V_<2 4 - Using this fact, P[£i(6)] is upper bounded 

by, 

< 3 |V - d '' max F s ,z- (71) 

ZQSQV-d i 

Therefore to show that P[£i(6)] vanishes as n — > 00, it suffices to show that each of Ps,z vanishes as 
n — > 00. To bound above the probability Fs,z, we use the following lemmas which provide an upper bound 
on the probability inside the last summation. 
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Lemma 6: For each s e Ms,z{^-A[b-l-v+2}i " " " > u ^4[6-v+i]j s fc)> we have 

P[£ 3 (&,M|S 6 = s 6 ] <2^-( fc ), 

where (3s,z{k) is given by 

fe(*0= E H ( X t)+ E ff(^^t0-^(^5n£,^nA_JX 5 o n£fc y 2On£fc _ 1 X £fe f £fe - 1 JC (2i y dj ). 

(72) 

Proof: See Appendix [EJ ■ 
Lemma 7: For each (u^[ 6 _£_y +2] , • • ■ , u^[ 6 _y +1] , s 6 ), we have 

Proof: See Appendix [F] ■ 
Applying Lemma [6] and Lemma [7] to ( f70| ) yields, 

F sz < 2-™(nt 1 1 /3 s ^(fc)-ELi^(%nrJ[/r fc \ S c/ cfc c/ dl )-E t - ez A^;^l^0). (73) 
Thus Ps 2 vanishes as n — > oo, provided that 

EfcW - E^SnA^AV^^J - E I(Y t ,;Y t ,\X r ) > 0. (74) 
fc=i fc=i t'e2 

Substituting ( f72| ) in ( f74| ), simplifies it as follows, 

< E^(^t) + E ^(^l^ 1 *) - E (^(^n^^^J^A^^-A^^^- 1 ^^) 
*eS t'ez fe=i 

t+i 

= E (^(^nA) + H(Y Z nC h -i\Xzr\C h -iYznC h -i) ~ H(XsnC h YznC k -i\Xc k \sYc h _ 1 \zX J chY £ ,h-iY d .) 
k=\ 

-H(U Sn c k \U Ck \ s U Ck U dt )) (76) 

i+i 

= E (^(^n£ fc ;^^ ) (£ fc _ 1 \2)u£ fc - 1 l^d 1 ^(£ fe \'5)u£ fe ) ~ I {Y Zn c k _ 1 ]Yznc k - 1 \X d X Ck +iY di Y( yCk _ i \ z ^yj Ck ^i) 

k=l 

-H(U SnCk \U Ck \ s U ck U d J) (77) 

where in (76i and ( f77] ), we have used the fact that XtS are independent and Y t given (X t , Y t ) is independent 
of all the other random variables. Now consider the RHS of ( |77] >. Since i? C 5, it can easily be shown that 
the first term inside the summation takes its minimum for Z = 5 while the second term simultaneously 
takes its maximum for Z = S. On the other side, Z = S corresponds to the probability Ps,s- Hence if 
Fs,s vanishes, then all Fs,z '■ Z C S vanish as n — > oo. Therefore P[£i(6)] vanishes if all Ps,s (<5 C V- di ) 
vanish. Finally, substituting Z = S in (75i results in (52 1, which completes the proof of Lemma |4] 

Remark 3: If there exists only a single destination, one can use the offset encoding scheme of |7j and 
(8J which has less delay compared to the proposed encoding scheme, to prove Lemma |4] In general however, 
since the ordered partitions corresponding to each receiver for reliable decoding are different, it is impossible 
to obtain the same offset encoding scheme for all the destinations. This makes it clear why the encoding 
scheme does not transmit any information in the first V — 1 blocks. 
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B. Removing additional constraints 

This subsection claims that for each di, we can reduce the constraints of (55 1 to the first term of it. 
A special case of our claim about the single relay channel has been studied in [24]. We prove our claim 
by induction on |V_dJ. For |V_dJ = 1, it is true. Now suppose the induction assumption is true for all 
k < |V_dj|. For each Z C V which contains di and each S C Z\{d{\, let 

hf\s) = - H(Y s X s \X z \ s Y zVSu{di}) Y di ) 
Assume there exists a subset T of A c \{di} such that hy l \T) < 0. For each W C V_ d . observe that, 



•> 1 <\W u T) = ht*\T) + <\V - H(Y w X w \X W c\ T Y W c\ {Tu{di}) Y di 



v 



< R 



vv\r ~ ^(^v^yv|^yv c \r^w c \(ru{d i })^ i ) 
= 4 d °(W) (78) 



Using (78 1, (55ai can be simplified as follows: 



H(U s pA\s) < min4 di) (W) 



VDWDS: 



( = } min h^CWUT) 

VDWDS: k ' 



eW- 



< min h\;;Uw\T) 

VDWDS: I'X/ 

= min /iSi(W) (79) 

V\TDWDS: V \l 



where (a) follows from ( |78| ), since S C WUT and ^ 7", and (b) follows from the first inequality in ( f78| ). 

Now by the induction assumption, the last term of ( |79] ) corresponds to the feasibility constraints of the 
reliable transmission of C/4 to node over the cooperative network with the set of nodes V\T. Hence node 
dj can decode C/4, by treating (X7-, I7-) as noise. We note that the encoding scheme, only results in more 
delay rather than a corresponding encoding/decoding scheme for a cooperative networks with the node's 
set V\T. Therefore, the encoding scheme does not need any changes and the decoding is done only with 
respect to the cooperative network with V = V\T. This proves our claim. 

VI. Slepian-Wolf coding over some classes of cooperative networks 

In this section, we extract some corollaries from Proposition [T] and Theorem [2] about semi-deterministic 
network, Aref networks and linear finite-field and state-dependent deterministic networks, for which Propo- 
sition [T] and Theorem [2] (partially) match. 

Definition 5: A cooperative network with one destination d, is said to be semi-deterministic, if each 
node v G V\{d} observes a deterministic function of all the channel inputs and the destination channel 
output, i.e., Y v = f v (X v ,Y d ). 

Remark 4: The semi-deterministic cooperative network is a generalization of semi-deterministic relay 
channel Q and a class of deterministic relay channels, recently defined in [27|. 
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Definition 6: A cooperative network is said to be deterministic, if each node observes a deterministic 
function of all the channel inputs, i.e., Y v = f v (Xy). 

Definition 7: A deterministic network is said to be an Aref network, if each channel output Y v can be 
decomposed into |V| — 1 components (Y V ' jV : v' G V\{v}), where Y V ' tV is a deterministic function of X v >. 
A semi-deterministic network with destination node d, is said to be a semi-deterministic Aref network, if 
each channel output Y v can be decomposed into |V| — 1 components (Y v > >v : v' G V\{u}), where Y V > )V is a 
deterministic function of X v > for v G V-d ana " Y v >^ is a stochastic function of X v i. 

Definition 8: A deterministic network is said to be a linear finite-field deterministic network, if all the 
channel inputs and outputs lie in the same field GF(g) and each channel output can be expressed as a linear 
combination of all the channel inputs. The relation between the channel inputs and the channel outputs can be 
determined via a matrix product, Yy = GXy, where G is called the channel matrix of the network. Gti,Ti 
is a sub-matrix obtained by deleting the rows and columns of G corresponding to 71 and T%, respectively. 

Definition 9: A cooperative network is state-dependent (SD) p9| , if there exists a set of states S such 
that the channel inputs and the channel outputs at each time are related via the current state of the network. 
A SD-cooperative network is said to be deterministic if each node observes a deterministic function of all the 
channel inputs and the state of the network, i.e., Y v = f v (Xy, S). A SD-deterministic network is said to be 
an Aref network, if each channel output Y v can be decomposed into |V| — 1 components (Y V > )V : v' G V\{u}), 
where Y V > ;V is a deterministic function of (X v >, S). A SD-linear finite-field deterministic network is a network 
described by Yy = G(S)Xy, where G(S) is the matrix of coefficients corresponding to state S. 

Proposition 2: The set of DMCS U A can reliably be transmitted over a semi-deterministic network, if 
there exists random variable Q, such that for each S C V, we have : 

H(U S \U A \ S )< min I(X w ;Y W a\X W oQ) (80) 

where the joint p.m.f. of random variables factors as p(q)[Yl ve yP(x v \q)]p(y\;\xy). 

On the other side, multicasting is feasible, only if there exists a joint p.m.f. p(xy) such that 

H(Us\U A \ s )< min I(X w ;Y W o\X W c). (81) 

Proposition 3: The set of DMCS U A can reliably be multicast over a deterministic network, if there 
exists a product distribution Y\ ve y p(x v ) such that for each S C V, we have: 

H(Us\U A \ S ) < min min H(Y W c \X W c) (82) 

On the other side, multicasting is feasible, only if there exists a joint p.m.f. p(xy) such that 

H (U S \U A \ S ) < min min H(Y W c\X W c). (83) 

Remark 5: Comparing the direct part and converse part of Propositions [2] and [3j we see that the sufficient 
conditions partially match to necessary conditions and these conditions completely match together, if we 
can restrict the set of joint p.m.f. in the converse part to the set of product distributions. 

Proposition 4: The set of DMCS U A can reliably be multicast over an Aref network, if and only if, 
there exists a product distribution n^ev P( x v) sucn that for each S C V, we have: 

H(U S \U A \ S ) < min min £ H(Y VjW a) (84) 
Remark 6: This proposition was partially proved in (3) for acyclic Aref networks. 
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Proposition 5: The set of DMCS U A can reliably be transmitted over a semi-deterministic Aref network, 
if and only if, there exists a product distribution n^ev^^) sucn tnat ^ or eacn 5 C V, we have: 

H(Us\U A \ s ) < mm £ I(X v ;Y v>w o) (85) 
- new 

Proposition 6: The set of DMCS U A can reliably be multicast over a linear finite-field deterministic 
network, if and only if, 

H(Us\Ua\s) < min min rankfGvu vuc) log o (86) 

Now, consider the SD-network. In the sequel, assume that the state S is an i.i.d. random process. 

Proposition 7: For reliable multicasting over a SD-deterministic network, if all destinations have the 
state information S, then a sufficient condition is given by, 

VS C A : H(U s \U A \s) < min min H(Y W c\X W c,S) (87) 



Moreover, condition ( [87] ) is a necessary condition for reliable multicasting over a SD-Aref network and 
a SD-linear finite-field deterministic network with state information available at the destinations. In these 



cases, ( [87] ) is simplified to, 

SD-Aref network •.H(Us\Ua\<?) < min min H(Y vW c\S) 

z ~ ~ new 

SD-linear finite-field deterministic network :H(Us\Ua\s) < min min Ee[rank(Gyw w c (S))] logo 

Proof of Propositions 2-7: The direct part of Propositions [2] and [3] follow from Theorem [2] by setting 
Y v = Y v in Theorem [2] because (Yyy : W C V_d) and (lyy : W C V) are deterministic functions of 
(Yd,Xv) and Xy, respectively. The converse part of Propositions [2] and [5] are the direct consequence of 
Proposition [TJ The direct part of Proposition [4] follows from Proposition [3] and the converse is deduced 
from Proposition [Tj as follows: 

H(Us\U A \ S ) < H(Y w a\X w a) 

< H(\J v( zy\)Y Vj yvo) 

< x: h(y v>W ) 

Now, since 1^ yyc depends only on X v , the last term of inequalities only depends on the mariginal p.m.f. of 
the random variables. Thus, we can restrict the set of joint p.m.f. of Proposition Tjto the product distribution, 
which completes the proof of Proposition |4] The direct part of Proposition [5 follows from Proposition [2] 
and the converse part is obtained from Proposition [JJ as follows: 

H(Us\U A \ s ) < I{Xy^-^Y WtW cY W o )W c\Xyvc) (88) 
= 7(Xyv; Y^y^c\Xy^cYy^c y^c) (89) 

<I(X w ;Y w ^c) (90) 
< J2 I(Xv;Y v , W c) (91) 
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where ([89]) follows, because X\\> — Xyyc — Yyycy^c form a Markov chain and (90 1 follows from the fact 
that {XyycYyvc y\ic) — Xyy — Yy^ y^c form a Markov chain and ( |9Tj ) follows, since Y„.w c given X v is 
independent of other random variables. Finally, note that the RHS of ( |9T| ) only depends on the marginal 
p.m.f. of the random variables Xy which implies the converse. 

The direct part of Proposition [6] is deduced from Proposition [3j by computing the RHS of ( [82] ) for 
the product distribution Yl ve yp{x v ), in which each X v is uniformly distributed over the field GF(q). 
The converse follows from Proposition [T] since the product and the uniform distribution simultaneously 
maximized the RHS of ([9J for all W C V. 

The sufficient condition of Proposition [7] is deduced from Theorem [2] by treating the state information 
at each destination as an additional output of the network and the fact that (Yyy : W C V-dJ is a 
deterministic function of (Xy,S). The necessary conditions for the SD-Aref network and the SD-linear 
finite-field deterministic network follow from similar arguments for the converse of these networks without 
state. ■ 



VII. Slepian-Wolf coding over Gaussian cooperative networks 

In the previous section, we focused on some networks for which the cut-set type necessary conditions 
became sufficient conditions at least for product distribution of channel inputs. In this section, we focus 
on the Gaussian networks for which simple forwarding of the observations of each node is impossible. 
Instead, following fl4j , p"5| , each node quantizes its observations at the noise level, then transmits these to 
the destinations. We compute sufficient conditions corresponding to this approach and compare it with the 
necessary conditions. 

Consider a Gaussian cooperative network, in which the received signal y„ is given by, 

y v = ^ h v ' jV ~x. V ' + z v (92) 
v'eV- v 

where h v > iV is a complex number which represents the channel gain from node v' to node v. Furthermore, 
we assume that each node has an average power constraint equal to one on its transmitted signal. Moreover, 
Z v is an i.i.d. complex Gaussian random process with variance a%. Theorem [5] is the main result of this 
section. 

Theorem 3: A set of DMCS U A can reliably be transmitted over a Gaussian network, if for each S C V, 
we have: 

H(U S \U A \ S ) < min min C wf (W -»• W c ) - k w (93) 

where 



C wf (W^W c )= max I(X w ;Y w a\X w a) 



and 



W = n.in{|W|, \W°\} log(l + min{| ^' |wC|} ) + V - 1 



Moreover, Kyy is bounded above by |V — 1. 

On the other side, the multicasting is feasible, only if: 



H(Us\U A \ s ) < min min C wf (W^W°) (94) 
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Remark 7: This theorem establishes the fact that multicasting of all DMCS whose Slepian-Wolf region 

intersects cut-set bound region within a —V — 1 bits, is feasible. 

6 2 

Proof: C w f{W — > W c ) is the capacity of the W x W c MIMO channel with antenna input X w and 
antenna output Yyyo . Now constraint (94 1 is a direct result of Proposition [TJ since there exists an average 
power constraint equal to one at each node v G W. To show ( |93] ), we apply Theorem [2] to the Gaussian 
network. Assume (X v : v £ W) be jointly complex Gaussian random variables with covariance matrix 
IvxV- Let y„ = Y„ + Zt, where Z v is a complex Gaussian random variable with variance equal to (In 
other words, Y v quantizes Y v at the noise level, [15]). Now consider, 

I(Xyy>; Y w a\X w c) = I(Xyy, Y w a Y w c\{ d .y \Xyyc) (95) 
= I{Xw] YdY w c\^ dt -j\X w c) + I(Xw)Yy V c\{ di y\XwcY w c\{ di y) (96) 

where ( [95] ) follows, since Xyj — (Xy^c , lyyc ) — Yyyc form a Markov chain. Next consider, 

I(Xyy, Y w c\{ d .y\Xy\;cYy V a\{ d .y) = I(X W ; Z w c\{ d .y \X w c , Y w c\{ di y + Z W G\{ d .y) (97) 

< h(Z w c\{ d .y) - h(Zyjc\{ d .}\Zyjc\{ d .} + Z W c\{ d .}) (98) 
= I(Z w c\{ d .y; Z w c\{ dz y + Z w c\{ d .}) (99) 

= \W C \ - 1 (100) 



where ( |97j ) follows from the definition of 3^,, ( |98j ) follows from the fact that conditioning does not increase 
entropy and the fact that conditioning on (Yy^c + Zy^c,X^) is equivalent to conditioning on (Zy^c + 
Zyyc,Xx>) and (Zyvc,Zyyc) is independent of X\>. ( 100[ ) follows, because {(Z V ,Z V ) : v 6 W c } are 
independent and Z w and Z„ are complex Gaussian r.v. with the same variance. In a similar way consider, 

I(Yw', Yw\XvY d .Y w c\{ d .y) = I(Zyv] Zyy + Zyv\X V} Z d ^Zyjc + Zy^c) 

= I(Z W ; Z w + Z w ) 

= \W\ (101) 



Next, we derive a slight modified version of Beam-Forming Lemma [15 Appendix F]. C w f(W — > W c ) 
with water-filling is given by 



n 



c wf (w ^w c ) = J2 iog(i + Quh 



i=l 

where n = min(|W|, \W |) and Aj's are the singular values of the channel matrix of the MIMO channel 
and Qa is given by water-filling solution satisfying, J27=i Qu = Following [Tl5] Appendix F, Equations 
140-143], we obtain, 

C wf {W -> W c ) - I(X w ;Y w \X W c) < nlog(l + ^) (102) 

n 

<nlog(— ) (103) 
n 

Finally, comparing ((%]>, ( |100[ ), ( |101| ) and ( |102[ ) we get, 

/(X w ; y di y W o X{di} |X wC ) > C W/ (W -> W c ) - k w (104) 
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Substituting it in ( [49] ), we conclude that the constraint (93 ) is a sufficient condition. Now note that n 6 [1, \) 



Define f(x) = xlog( — ) on [1, / is a convex function and gets its maximum at the end point ~. Hence 

. . X V o 

the RHS of ([103]) is equal to or less than — which results in Kyy < | V — 1. ■ 



VIII. Achievable rate region for cooperative relay networks 

Consider A = V and the sources (U v : v G V) are statistically independent and uniformly distributed 
over the sets M. v = {1, 2, • • • , 2^"}, thus H{U V ) = R v . Substituting these values in Theorem^! we find an 
achievable rate region which is based on the CF, for cooperative relay networks with multicast demands. 

Theorem 4: A V-tuple (R\, i?2> • • • ,Rv) is contained in the achievable rate region of a cooperative 
network with multicast demands at each node dj £ V, if for each SCV the following constraint holds: 

R s < mm v ^n s [l(X w -Y di Y w o\ {di} \X w aQ)-I{Y w -,^ (105) 

where [x]+ = max{x,0} and the joint p.m.f. of (q,x v ,yv,yv) factors as p(q) H veV p(x v \q)p(y v \x v , y v , q)]p{w 

Proof: Let T be the largest subset of V such that the RHS of (|49j> is non-negative subject to each 
5 C T (Note that if two subsets 71 , 7*2 have this property, then T\ U % also has this property, hence T is 
unique.). Substituting R$ = H(Us\Usc) in Theorem [2] yields that I/7- can reliably be multicast, if ( 105 1 
holds. Hence (R%, ■ ■ ■ , Ry) is achievable (Note that R v = for each node v G 7" c ). ■ 
Corollary 1: Consider a relay network with node 1 as a transmitter which has no channel output, i.e., 
Y\ = 0, N — 2 relay nodes {2, • • • , N — 1} and node iV as a destination which has no channel input, i.e., 
Xn = 0- Substituting R2 = ■ ■ ■ = Rn = in Theorem [4] gives the following achievable rate (Rcf) for 
relay network. 

R C f= min [l(X s ;Y s c\ {v} Y N \X s cQ)-I(Ys-,Y s \X v Y v Y s c\ {vy Q)] + (106) 

Remark 8: For the single relay channel, the achievable rate is reduced to the CF rate with time-sharing 
as given in pUJ . 

Remark 9: In fl6|], we obtain an achievable rate based on CF, which subsumes the CF rate given in [8], 



when the partial decoding part of the CF strategy is relaxed. The CF rate in 1 16 Theorem 3] is given by: 

R* CF = I(X r ,Y v Y v _ v \X v _ v ) (107) 

subject to the constraints 

VS C V\{l,V} : IiYslYslXv^YvYso^vy) < I(X s ;Y v Y S c\ {v} \X S c\ {v} ) (108) 
Now let Q = in Corollary [T] It can be easily shown that when the constraints ( 108 1 hold, then S = V 



reaches the minimum of the RHS of ( 106| ). Therefore, the rate of Corollary [T] subsumes the CF-rate given 
in |16j Theorem 3]. 

Corollary 2: Consider a two-way relay network with nodes 1 and V as the two transmitters each demand- 
ing the message of the other one, and V — 2 relay nodes {2, • • • , V — 1}. Substituting R 2 = • • • = Ry-i = 
and Yi = Yy = in Theorem |4| gives the following achievable rate region for the two-way relay network. 

k = l,V: R k = mm [l(X s ; Y S c m Y h \X S o) - I(Y SX{k} ; Y sm \X v Y h Y S a m )} + (109) 



where 1 = V and V = 1. 



26 



Remark 10: Propositions [2J7] are generalizations of several recent works on deterministic relay networks 
including [4, Theorem 3.9], p3| Theorem 4.2], | [T5} Theorem 4.4], [27 , Theorem 1], [28 , Theorem 1] and 



p9] Theorem 1]. 

Next, consider the Gaussian cooperative network. Applying Theorem [3] to Uy, we conclude the following 
corollary which shows that the cut-set bound region is achievable within a constant number of bits. 

Corollary 3: A V-tuple (Ri, R2, ■ ■ ■ ,Rv) is contained in the achievable rate region of a Gaussian 
cooperative network with multicast demands at each node di € V, if for each S C V the following constraint 
holds: 

R s < min min C wf (W -»• W c ) - k w (110) 

where C^/ and Kyy are as defined in Theorem [3] 

Remark 11: In fl3| Theorem 4.6], authors have shown that by quantization at noise level, Gaussian 
relay network achieves the cut-set bound within 14V bits. But Corollary [3] implies that quantization at noise 

level achieves the cut- set bound within —V — 1 bits; thus we have tightened the gap between the achievable 
rate and the cut-set bound. A similar result holds for the two-way Gaussian relay network. 

IX. CONCLUSIONS 

We derived sufficient and necessary conditions for reliable multicasting of DMCS over cooperative 
networks. Necessary conditions were based on the cut-set type outer bound for the relay network. Suf- 
ficient conditions are based on joint source-channel coding, compress and forward strategy for the relay 
network and an identity related to the sub-modularity property of the entropy function. We showed that the 
sufficient conditions are indeed necessary conditions for some classes of deterministic networks including 
Aref networks and the linear finite-field deterministic networks. We also proved that multicasting of DMCS 
whose Slepian-Wolf region intersects the cut-set outer bound within a constant number of bits are feasible. 
In particular, we reduced all results of the paper to obtain achievable rate regions for multiple messages- 
multicast over the cooperative relay networks. We showed that this achievable rate region subsumes some 
recent achievable rate (region) for relay networks. 

Appendix A 
Proof of LemmaQ] 

We prove this lemma by contradiction. Let d be the dimension of P. Suppose F is not a closed covering 
of P, so there exists a point A inside P which is not covered by F (Note that by assumption 2, the points 
that lie on the boundary of P are covered). Let B be the closest point in U™ =1 P, to A. It is clear that B must 
lie on a facet of at least one of the poly topes (Pj : 1 < i < n). Denote this facet by Fp r Two situations 
arise: 

1) Fp^ lies inside P. Now by assumption 3, there exists k / j, such that Pj n P& = Fp^. Let S(B,e) 
be a d-dimensional sphere with center B and radius e which is small enough such that S(B,e) is 
contained in Pj U Pfc. Then the segment AB intersects S(£>, e) at a point C which belongs to one 
of Pj or Pj.. Now C is closer than B to A and lies on U" =1 Pj. This results in contradiction, which 
proves lemma in this case. 

2) Fp lies on the boundary of P. Let S(B,e) be a sphere with center B and radius e which is small 
enough such that S(B, e) only intersects Pj. Since, A lies inside P, the segment AB intersects S(B, e) 
at a point C inside P. By assumption, C belongs to Pj, which again results in contradiction that proves 
the lemma. 
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Appendix B 
Proof of Lemma[2] 

Denote the RHS of ((4]) by F*j T . First, we prove that F^ r C Ff j-. Suppose x belongs to F^ r . Now 
for each U C V, we have: 

= x U nr + %unT c (HI) 

> f(UC\T\{UC\T) c ) + f{UC\T c \T c C\U c ) (112) 

= /(V) - f(U c U T c ) + /(T c ) - f(U c n 7"°) (113) 

>f{V)-f{U c ) (114) 

= /W C ) (115) 



where ( |1 12[ ) follows from the definition of Fyj- and ( 1 14 i follows, since / is a sub-modular function. Now, 
( 1 15| ) yields x € F/-7-. Hence F^^- C Ff t f. On the other side, assume x E F/^. Note that by definition, 



X7- G F/ 1 ^-. For each 5 C T c , consider: 

xs = %TuS ~ %T (116) 

= xtus ~ f{T\T c ) (117) 

> f(TuS\T c nS c ) -f{T\T c ) (118) 

= f(T c )-f(T c nS c ) (119) 

= f(S\T c \S) (120) 

3(2) 



where ( 117 1 follows, because x lies on the hyperplane xj- = f(T\T c ). Now, ( 120| ) implies that xj-c £F^| 



which results in F /)T C F£ T> Thus F* fT = F /jT . 

Next, we show that Fjj- = Ff 1 and F^- = P/ 2 - First observe that since / is sub-modular, f\ and /2 
are sub-modular functions. Hence P * x and P j 2 are well defined. Moreover, note that 

VcS C T : fi(S\T\S) = ft(S U T) - h{T\S) 

= f(SuT\T c )-f(T\S\T c ) 
= f(V)-f([T\S]uT c ) 
= /(V) - f(S c ) 

= f(S\S c ) (121) 



Comparing ( |121| ) and ([5]) with Definition j^J we conclude that F^- is the essential polytope of f\ with 
dimension |7~| — 1. Likewise, we can show that Fjlj- is the essential polytope of f2 with dimension \T C \ — 1. 



This completes the proof. 



Appendix C 



Formal Proof of Equation <[46j> 
By Lemma [2| it suffices to prove the following identities: 

SCT C : h c (S\T c \S) = h C '( y S\S c ) (122) 
SQT: h c (S\S c ) = h c ,(S\T\S) (123) 
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We prove the first identity. Proof of the second identity is similar. For each S C T c consider, 

h c {S\T c \S) = h c (T c ) - h c {T c n S c ) 



(124) 



K+l 



^ H{X r c nCk Y r c nCk l \X C kY C k-i) - H(X T c nS c n c k Y r c nS c n c k _ 1 \X C kY J r k -i) (125) 



fc=i 

A'+l 



/ J H(Xs n c k Ys n c k _ 1 \X r c nS c nCk Y T c nS c nCk _ 1 Xc k Y C k 



(126) 



fc=i 



Note that C* k = C k 1 U (T c n £jfe_i). Moreover, for each S C T c , simple calculations yield: 

s n a = s n [(Tn A-i) u (T c n c k )) =snc k 
s c nc* k = [TnCk-i] u [s c n r c n £ fc ] 



substituting ( 127 1 in ( 126l gives: 



K+l 



/ic(5|T c \5) = ^2 H ( x snciYsnci_ 1 \X s c nCk Y s c nC * k _ i Xc* k Yc*k-iZ) 



k=l 
K+2 



^ ^(^5n£*^n£*_J^5 c n£*Xs c n£*_ 1 ^£* fc ^c* fc - 1 ^) 



fc=l 



h c *(<S\S 



(127) 



(128) 



(129) 



(130) 



where in the last step, we have used the fact that S D £^ , x = 5fl = 0. This completes the proof.D 



Appendix D 



Equivalence of Constraints d55) and (56i 



It is sufficient to show that the RHS of ( 55a ) and ( 56a I are equal. Substituting R v = H (X v )+H (Y V \X V Y V ) 
in the RHS of ( |55a[ ) gives, 

- H(YyvXy\;\Xy\;cY W c\{ d .yY ( ] f .) = H(X W ) + H(Yyv\Xy\;Yyv) - H(Y w X W \X W cY W c\{ dz }Y dz ) 

(131) 

= I{Xw,Y W a\{ d .jY ( i i \Xwc) + H(Yw\XwYw) 

- H(Yyv\XvY W c\{ d .yY ( i.) 
= I(Xyy, Y W c\{ d .yY dt \X W c) - IiXw'i Y\v\XvYw c \{d,}Yd i ) 

(132) 



where ( 131 1 follows from the fact that di ^ W, X t 's are independent and Y t given (X t ,Y t ) is independent 
of all other random variables and ( 132 ) follows, since {Xy^cYy^aY^.) — (Xyv,Yy\?) — Yy> forms a markov 
chain. Substituting (|132[) in (|55a[) shows that (|55a[) and (|56a[) are equal. Also, using (|1 32j> with W = 5 



shows that (55b I and (56b I are equal. 
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Appendix E 
Proof of Lemma[6] 

According to the codebook generation and the definition of Ns,z{^Mb-t-V+2}->' ' ' > u A[b~v+i]i s b) m 
, (Xt(st) : t E 5 n and (Xv(sv,[6-fc+i])) are drawn independently from the sets T™,{X t ) and 
T e n (X v ). Also given yip ,[b—k+l] G ^ H Yc(zc|X t / i s drawn uniformly from the set 

T e n (Yf|X t / fc+i] ) and i s independent from other random variables. Hence the joint p.m.f. of 

(xsn£AX s .sn£j,xv(svjfc-AM-i]);5^zn£ fc _i(^ factors as 

f[^v(sv,[b-k+i]),yc-^u(c k ^\z),lb~k+i],yd„[b-k+i]} n ^x t (xt(st)) Yl P Y t |x t (y*'(^'l x t',[f)-fc+i])) 5 

tesn£ fc i'e^nCn 

(133) 

where Px t and fy- t |X( are un if° rm distributions on the sets T™,{Xt) and T™{Yt\Xt), respectively Now, we 
upper bound ¥[£$(b, k, s)] for each s E A/5,2: (u.A[6-^-v+2] i • • • , u^.[6-v+i], s&) as follows, 

F[£ 3 (b, k,s)] = y^P[x£ fc \s(s £t \sj b _ fc+11 ),y£ fc l \;g^ 

(x£ fc \s(sc fc \sjb-/ t +i]),yc fc _ 1 \z(zc fc _ 1 \z,[b-fc]),x £ fc [6 _ fc+1 py c fc-i [b _ fc+1] ,x d .,y d .)eT e " 

e n p x t (x 4 ( St )) n p^(M^,[b-k+i])) (134) 

T " (x Snc k Y zn£. k _ 1 l*-c k \s,[b-k+ll>yc k _ 1 \Z,[l>->°] S C>>-l[b-k+iy y <li, [b-k+ 1]) 

= I p [( x £ fc vs( s ,c (I vs,[b-fc+i])> Y £ fc 1 \ 2 ( 2; z: fc _ 1 \2,[6-fc])> Y £ fc -i[ b _ fe+1 ],X d . i [ 6 _ fc+1 ], Y d . i [ 6 _ A;+1 ]) G T™] 
I ?T (X 5n ^ Y2n£ fc - 1 1 x c k \s Y c k _ 1 \z x c Y c k - 1 x d ( Ydi ) I 



(135) 

< 2- nps - z{k) (136) 

where ( |134| ) follows from ( |133[ ), ( |135| ) follows from the definition of Py t and Pf t \x t anc ^ 4HH > i s a result 
of the properties of jointly typical sequences. 

Appendix F 
Proof of Lemma|7] 

According to the definition of M s ,zip-A[b-i-V+2\ ■ ■ ■ ,u A[b _ v+1] ,s b ), for s = (w Cl ,z Cl ,- ■ ■ ,w Ce ,z Ce ) 
in ([68]), each (z v : v G Z) takes 2 n( - I &;Yv\x v )+S) _ 1 different values and each (z v : v G V-d\Z) takes a 
fixed value, thus zy_ d takes less than 2 n ^*ez / ( y t; y tl x *)) different values. Also, according to the definition 
for each k G W£ h \s takes the fixed value W£ k \s,[b-k+l] an d w snc k must satisfy the following relation: 

usnc k (wc k ns) G T e n (^c fc (^£ fc ns)|u£ fc \ 5 (w£ fc \ 5 j b _£; + i]), U£fc i[fe _ fc _y +2 ], u d . ) [ 6 _ fc _ v+2 ]). 

Thus u,5 n £ fc (tu£ fc n.s) ( or equivalently wc k ns) takes at most 2 n( - H ^ Usnc ^ Uc ^ sU ^ kUd i^ different values. There- 
fore, wy_ d , takes at most 2 n ^ k =i H ( u snc k \Uc k \sU ck u di )) different values. Now, comparing the bounds on the 
number of possible choices for zy_ d and Wy_ d yields the lemma. 
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